EntityRank: Searching Entities Directly and Holistically
نویسندگان
چکیده
As the Web has evolved into a data-rich repository, with the standard “page view,” current search engines are becoming increasingly inadequate for a wide range of query tasks. While we often search for various data “entities” (e.g., phone number, paper PDF, date), today’s engines only take us indirectly to pages. While entities appear in many pages, current engines only find each page individually. Toward searching directly and holistically for finding information of finer granularity, we study the problem of entity search, a significant departure from traditional document retrieval. We focus on the core challenge of ranking entities, by distilling its underlying conceptual model Impression Model and developing a probabilistic ranking framework, EntityRank, that is able to seamlessly integrate both local and global information in ranking. We evaluate our online prototype over a 2TB Web corpus, and show that EntityRank performs effectively.
منابع مشابه
Entity-Supported Summarization of Biomedical Abstracts
The increasing amount of biomedical information that is available for researchers and clinicians makes it harder to quickly find the right information. Automatic summarization of multiple texts can provide summaries specific to the user’s information needs. In this paper we look into the use named-entity recognition for graph-based summarization. We extend the LexRank algorithm with information...
متن کاملInvestigating the Impact of Taxes on the Income of Legal Entities (Companies) on the Cost of Urban And Rural Households using Input-output analysis
Current situation of Iran Economy which is under increasing sanction can be affected negatively and followed by many consequences for the society through any misleading economic decision from government side.The Iranian tax system includes various types of taxes that are directly and indirectly collected, each of them have its own effects on society and market system. Determining the way of col...
متن کاملHow consumers search for health information
To date most of the research concerning consumer health information has focused on trust and quality of health information websites. In this research, we observed 48 consumers searching for four health-related topics (some of their own choosing) using Google. Using transaction logs, video screen capture, retrospective verbal protocols and self-reported questionnaires, we examined holistically t...
متن کاملMethodology for Searching Entities on the Web
The Semantic Web is driven by the idea of moving from a Web of documents, designed for human consumption, to a Web of data in order to “create a universal medium for the exchange of data where data can be shared and processed by automated tools as well as by people”1. Nowadays, more and more machine-readable annotations and meta-data are available on the Web. This data, typically codified using...
متن کاملEntity Linking for Queries by Searching Wikipedia Sentences
We present a simple yet effective approach for linking entities in queries. The key idea is to search sentences similar to a query from Wikipedia articles and directly use the human-annotated entities in the similar sentences as candidate entities for the query. Then, we employ a rich set of features, such as link-probability, contextmatching, word embeddings, and relatedness among candidate en...
متن کامل